Panel Tracking for the Extraction and the Classification of Speech Balloons

نویسندگان

  • Hadi S. Jomaa
  • Mariette Awad
  • Lina Ghaibeh
چکیده

Searching for texts inside a full comic strip may be exhaustive, and can be simplified by restricting the scope of the search to single panels, and better yet to within individual speech balloon. In this paper, a novel approach is devised where a tracking algorithm is employed for panel extraction, and speech balloons are identified using ‘Roberts’ edge detection operator as well as a classifier to find the number of balloons within every panel using a nonexhaustive projection method. Two main objectives in the field of comic strip understanding are achieved through our panel tracking for the extraction and classification of speech balloons (PaTEC). PaTEC may be incorporated as a precursor to text extraction and recognition reducing the computational time and effort of searching the whole image to the speech balloon area itself. PaTEC accuracy for panel extraction is 88.78% while balloon classification accuracy is 81.49% on a homegrown comic database.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain

This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...

متن کامل

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

An Intelligent PV Panel Structure to Extract the Maximum Power in Mismatch Irradiance

a new intelligent photovoltaic (PV) panel structure to extract the maximum power in mismatch irradiance is proposed. In conventional structures, difference of irradiance between series panels can cause the deviation of maximum power point. In this condition tracking MPP becomes difficult and reduces efficiency. Improvements in power electronics and its effects in PV industrial technology, devel...

متن کامل

Classification of emotional speech using spectral pattern features

Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...

متن کامل

Improving the Tracking Error Signal Extraction in IR Seeker with Stationary Wagon Wheel Reticle over all Field of View

The accuracy of target position detection in IR seeker depends on the accuracy of tracking error signal (TES) extraction from seeker Field of View (FOV). The type of reticle inside the seeker determines the output modulation signal that carries the TES. In this paper, the stationary wagon wheel reticle is used, which makes the type of the output signal as FM modulation in the linear region of F...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015